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Abstract. We present a simple construction of quantum automata which 
achieve an exponential advantage over classical finite automata. Our au- 
tomata use I log2p + 0(1) states to recognize a language that requires 
p states classically. The construction is both substantially simpler and 
achieves a better constant in the front of logp than the previously known 
construction of [5]. 

Similarly to [2], our construction is by a probabilistic argument. We 
consider the possibility to derandomize it and present some results in 
this direction. 

1 Introduction 

Quantum finite automata are a mathematical model for quantum com- 
puters with limited memory. A quantum finite automaton has a finite 
state space and apphes a sequence of transformations, corresponding to 
the letter of the input word to this state space. At the end, the state of 
the quantum automaton is measured and the input word is accepted or 
rejected, depending on the outcome of the measurement. 
Most commonly, finite automata (including quantum finite automata) 
are studied in 1-way model where the transformations corresponding to 
the letters of the input word are applied in the order of the letters in the 
word, from the left to the right. (More general 2-way models [S] allow 
the order of the transformations to depend on the results of the previous 
transformations.) 

For 1-way model (which we consider the most natural model in the quan- 
tum setting), the set of languages (computational problems) that can be 
recognized (computed) by a quantum automaton is the same for classi- 
cal automat£0- However, quantum automata can be exponentially more 
space-efficient than classical automata [2]. This is one of only two results 
that show an exponential advantage for quantum algorithms in space 
complexity. (The other is the recent exponential separation for online 
algorithms by Le Gall [5].) 

Our first result is an improved exponential separation between quantum 
and classical finite automata, for the same computational problem as in 
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[5]. The construction in [2] is quite inefficient. While it produces an ex- 
ample where classical automata require p states and quantum automata 
require Clogp states, the constant C is fairly large. In this paper, we 
provide a new construction with a better constant and, also, a much 
simpler analysis. (A detailed comparison between our results and is 
given in section I3TT] ') 

Second, both construction of QFAs in and this paper are probabilis- 
tic. That is, they employ a sequence of parameters that are chosen at 
random and hardwired into the QFA. In the last section, we give two non- 
probabilistic constructions of QFAs for the same language. The first of 
them gives QFAs with O(logp) states but its correctness is only shown 
by numerical experiments. The second construction gives QFAs with 
C'(Iog'^"'"'^ p) states but is provably correct. 

2 Definitions 

2.1 Quantum finite automata 

We consider 1-way quantum finite automata (QFA) as defined in [10| . 
Namely, a 1-way QFA is a tuple AI = (Q, S, S, qo, Qacc, Qrej) where Q is 
a finite set of states, S is an input alphabet, 5 is a transition function, 
go £ Q is a starting state, Qacc and Qrej are sets of accepting and 
rejecting states and Q = Qacc U Qrej- i and $ are symbols that do 
not belong to 17. We use (f^ and $ as the left and the right endmarker, 
respectively. The working alphabet oi M \s F = E VJ $}. 
A superposition of M is any element of hiQ) (the space of mappings 
from Q to (D with I2 norm). For q £ Q, \q} denotes the unit vector with 
value 1 at g and elsewhere. All elements of hiQ) can be expressed as 
linear combinations of vectors |g). We will use V' to denote elements of 
hiQ). 

The transition function S maps Q x F x Q to C The value S{qi,a,q2) 
is the amplitude of |g2) in the superposition of states to which M goes 
from |gi) after reading a. For a £ F, Va is a, linear transformation on 
hiQ) defined by 

K(|gi)) = ^ <5(gi,a,g2)|g2). (1) 
We require all Va to be unitary. 

The computation of a QFA starts in the superposition |go). Then trans- 
formations corresponding to the left endmarker </,, the letters of the input 
word X and the right endmarker $ are applied. The transformation cor- 
responding to a G _r is just Va- If the superposition before reading a is 
ip, then the superposition after reading a is Va{tp)- 

After reading the right endmarker, the current state ?/) is observed with 
respect to the observable Eacc(B Erej where Eacc = span{\q) : q G Qacc}, 
Erej = span{\q) : q £ Qrej}- This observation gives x £ Ei with the 
probability equal to the square of the projection of to Ei. After that, 
the superposition collapses to this projection. 



If we get -i/j G Eacc, the input is accepted. If -i/) € Erej, the input is 
rejected. 

Another definition of QFAs. Independently of ^Q\, quantum au- 
tomata were introduced in [S]. There is one difference between these 
two definitions. In [S], a QFA is observed after reading each letter (after 
doing each Va)- In [10) . a QFA is observed only after all letters have been 
read. The definition of [S] is more general. But, in this paper, we follow 
the definition of fT01 because it is simpler and sufficient to describe our 
automaton. 



2.2 Unitary transformations 

We use the following theorem from linear algebra. 

Theorem 1. Let ai, . . ., am be such that |aip + . . . + \am.^ = 1. Then, 

1. there is a unitary transformation Ui such that Ui\qi) = ai|gi) + 
. . . + am\qm)- 

2. there is a unitary transformation U2 such that, for all i £ {1, ... , m}, 
U2\qi) is equal to ai\qi) plus some combination of \q2) , . ■ ■, \qm). 

In the second case, we also have 

U2{ai\qi) + . . . + Ofmlgm)) = 

3 Space-efRcient quantum automaton 
3.1 Summary of results 

Let p be a prime. We consider the language Lp = { a' | i is divisible 
by p }. It is easy to see that any deterministic 1-way finite automaton 
recognizing Lp has at least p states. However, there is a much more 
efficient QFA! Namely, Ambainis and Freivalds [2] have shown that Lp 
can be recognized by a QFA with O(logp) states. 

The big-0 constant in this result depends on the required probability of 
correct answer. For x £ Lp, the answer is always correct with probability 
1. For X ^ Lp, [21 give 

— a QFA with 16 log p states that is correct with probability at least 
1/8 on inputs x ^ Lp. 

— a QFA with poZy(i) log p states that is correct with probability at 
least 1 — e on inputs x ^ Lp (where poly{x) is some polynomial in 
x). 

In this paper, we present a simpler construction of QFAs that achieves 
a better big-O constant. 

Theorem 2. For any e > 0, there is a QFA with 4i2g2E gf-ates recogniz- 
ing Lp with probability at least 1 — e. 



3.2 Proof of Theorem H 



Let Uk, for k £ {1, ... ,p — 1}, be a quantum automaton with a set of 
states Q = {qo,qi}, a starting state \qo), Qacc = {go}, Qrej = {qi}- 
The transition function is defined as follows. Reading a maps \qo) to 
cos (fi\qo) + sin (li\qi) and \qi) to — sin (l)\qo) + cos (j>\qi) where = (It 
is easy to check that this transformation is unitary.) Reading and $ 
leaves \qo) and |gi) unchanged. 

Lemma 1 . After reading a-' , the state of Uk is 

cos(^^^ \qo)+sm(^^^ 

Proof. By induction. □ 
If j is divisible by p, then is a multiple of 2tv, cos(22^) = 1, 

sin(2^^) — 0, reading a^ maps the starting state \qo) to |qo)- There- 
fore, we get an accepting state with probability 1. This means that all 
automata Uk accept words in L with probability 1. 

Let ki,. . . ,kd be a sequence of d = clogp numbers. We construct an 
automaton U by combining Uk^, ■ ■ ■, Uk^- The set of states consists of 
2d states gi,o, gi,i, (?2,o, 92,1, • • •, qd,o, qd,i- The starting state is qi^. 
The transformation for left endmarker ^ is such that V^(|gi,o)) = \^o) 
where 

\ipo) = -^(Igi.o) + lg2,o) + . . . \qd,o))- 

This transformation exists by first part of Theorem [T] The transforma- 
tion for a is defined by 

Va{\qi.o)) = COS ^^^lg,;,o) + sin ^^^|gi,i), 
P P 

Va(\qx,i)) = - sm g,;,(,) + cos gi,i). 

P P 

The transformation V$ is as follows. The states \qi,i) are left unchanged. 
On the states |gi,o), V$jgi,o) is "Tjki.o) plus some other state (part 2 of 
Theorem[TJ applied to |gi,o), • • •, \qd,o})- In particular, 

V$\i)o) = Iqi.o). 

The set of accepting states Qacc consists of one state gi,o. AH other states 
qij belong to Qrej- 

Claim. If the input word is a-' and j is divisible by p, then U accepts 
with probability 1. 

Proof. The left endmarker maps the starting state to |i/'o)- Reading j 
letters a maps each |gi,o) to itself (see analysis of Uk). Therefore, the 
state \^o) which consists of various \qi,o) is also mapped to itself. The 
right endmarker maps \ipo} to |gi,o) which is an accepting state. □ 



Claim. If the input word is o^, j not divisible by p, U accepts with 
probability 

cos + cos + ...+COS ^ . 2) 

\ p p P J 

Proof. By Lemma [1] a-' maps |gi,o) to cos \qi,o) + sin 
Therefore, the state before reading the right endmarker $ is 

d 

1 v-^, 2-Kkij . 2-Kkij 
— > (cos \qifi) + sm 

Vrf ^ P P 

i — l 

The right endmarker maps each \qi,o) to --^|gi,o) plus superposition of 
other basis states. Therefore, the state after reading the right endmarker 
$ is 

d 
i — l 

plus other states \qi,j}. Since |gi,o) is the only accepting state, the prob- 
ability of accepting is the square of the coefficient of |5i,o)- This proves 
the lemma. □ 
We use the following theorem from probability theory (variant of Azuma's 
theorem 1 11] ). 

Theorem 3. Let Xi, . . . , Xd be independent random variables such that 
E[Xi] — and the value of Xi is always between -1 and 1. Then, 

Pr[\^X,\ > A] < 2e-^. 

i=l 

We apply this theorem as follows. Fix j G {l,...,p — 1}. Pick each of 
ki, . . . ,kd randomly from {0, . . . ,p—l}. Define Xi = cos . We claim 
that Xi satisfy the conditions of theorem. Obviously, the value of cos 
function is between -1 and 1. The expectation of Xi is 

p-i 

2TTkj 



E[X,] = 1 ^ cos 



P 

fe=0 

since ki = k for each k G {0, . . . ,p — 1} with probability 1/p. We have 
cos — cos p) because cos(27r + x) = cos a;. Consider the 

numbers 0, j, 2j mod p, . . ., (p— l)j mod p. They are all distinct. (Since 
p is prime, kj — fc'j(modp) implies k — k' .) Therefore, the numbers 0, 
j, 2j mod p, . . ., (p— l)j mod p are just 0, 1, . . . ,p—l in a different order. 
This means that the expectation of Xi is 

iJM = iycosM. 



This is equal to 0. 



By equation the probability of accepting is -^{Xi + . . . + Xdf ■ 
To achieve 

we need \Xi + . . . + Xd\ < y/td. By Theorem [S] the probabihty that this 
does not happen is at most 2e "2" . 

There are p — 1 possible inputs not in L: a^, . . ., a*'"'^. The probability 
that one of them gets accepted with probability more than e is at most 
2(p- l)e-T. If 

2(p_l)e-*<l, (3) 

then there is at least one choice oi k\, . . . ,kd for which U does not accept 
any of , . . ., a''~^ with probability more than e. The equation Q is 
true if we take d = 2^-^^. The number of states for U is A^-^^. □ 

4 Explicit constructions of QFAs 

In the previous section, we proved what for every e > and p £ P, 
there is a QFA with 4^2^22 states recognizing Lp with probability at 
least 1 — e. The proposed QFA construction depends on d = 2 
parameters ki, . . . ,kd and accepts input word ^ Lp with probability 

It is possible to choose ki, . . . ,kd values to ensure 
1 / 27rfc 

\ i = l 

or, equivalently. 



< V^d (4) 
for every a-' ^ Lp. 

However, our proof is by a probabilistic argument and does not give an 
explicit sequence k\, . . . ,kd- We now present two constructions of explicit 
sequences. The first construction works well in numerical experiments 
and gives a QFA with O(logp) states in all the cases that we tested. 
The second construction uses a slightly larger number of states but has 
a rigorous proof of correctness. 

4.1 The first construction: cyclic sequences 

We conjecture 

Hypothesis 1 If g is a primitive root modulo p £ P, then sequence 
Sg — {ki = mod pjiLi for all d and all j : a-' ^ Lp satisfies 




d 



2'Kkij 



We will call g a sequence generator: The corresponding sequence will be 
referred as cyclic sequence. We have checked all p £ {2, . . . ,9973}, all 
generators g and all sequence lengths d < p (choosing a corresponding e 
value) and haven't found any counterexample to our hypothesis. 

We now describe numerical experiments comparing two strategies: using 
a random sequence ki, . . . ,kd and using a cyclic sequence. 

We will use Srand to dcuote random sequence and Sg to denote a cyclic 
sequence with generator g. We will also use trand and eg to denote the 
maximum probability with which a corresponding automata accepts in- 
put word a-' ^ Lp. 

Table[T]shows e^and and eg for different p and g values. e,.and is calculated 
as an average over 5000 randomly selected sequences, eg is for one specific 
generator, e in the second column shows the theoretical upper bound 
given by Theorem [5] 



p 


e 


d 


9 


^ra7id 




f523 


0,1 


161 


948 


0,03635 


0,01517 


2689 


0,1 


172 


656 


0,03767 


0,01950 


3671 


0,1 


179 


2134 


0,03803 


0,02122 


4093 


0,1 


181 


772 


0,03822 


0,01803 


5861 


0,1 


188 


2190 


0,03898 


0,01825 


6247 


0,1 


189 


406 


0,03922 


0,02006 


7481 


0,1 


193 


6978 


0,03932 


0,01691 


8581 


0,1 


196 


5567 


0,03942 


0,02057 


9883 


0,1 


198 


1260 


0,04011 


0,01905 



Table 1. Crand and eg for different p and g 



In 99.98% - 99.99% of our experiments, random sequences achieved the 
bound of Theorem [21 Surprisingly, cyclic sequences substantially outper- 
form random ones in almost all the cases. 

More precisely, for randomly selected p £ P, e > and generator g, a 
cyclic sequence 5*9 gives a better result than a random sequence Srand 
in 98.29% of cases. A few random instances are shown in Figure [T] For 
each instance, we show the bound dy^ on (j4]) obtained by a probabilistic 
argument, the maximum of frand{j) (which is defined as the value of Q 
for the sequence Srand) over all j, ^ Lp and the maximum of fg{j) 
(defined in a similar way using Sg instead of Srand)- 

In 1.81% of cases, we got that sup \fg{j)\ > sup | /rand (j)|, where sup \frand 
is calculated as an average over 5000 randomly selected sequences. Fig- 
ure [2] shows one of these cases: p = 9059, e — 0.09 and g = 2689, com- 
paring the cyclic sequence with 9 different randomly chosen sequences. 
The cyclic sequence gives a slightly worse result than most of the ran- 
dom ones, but still beats the probabilistic bound on Q by a substantial 
amount . 




4 I « T 



□ bound = dfT Bsig? I' □aupM^gil 



Fig. 1. sup and sup \frandU)\ for random p, e and , 



Comparing different generators Every p £ P might have muhiple 
generators. Table [2] shows eg values for p — 9059 and e = 0.1 (sequence 
length d = 197, = 62.0101221453601). 



9 


^9 


9 


^9 


9 




102 


0,02533 


1545 


0,01858 


9023 


0,01807 


103 


0,03758 


1546 


0,02235 


9033 


0,01413 


105 


0,01999 


1549 


0,02896 


9034 


0,01485 


106 


0,02852 


1552 


0,02873 


9036 


0,02509 


110 


0,01685 


1553 


0,02624 


9039 


0,02311 



Table 2. eg values for different generators, p = 9059 



Different generators have different eg values. We will use Qmin to refer a 
minimal generator, i.e. one having a minimal eg. Table [3] shows minimal 
generators for p values from table [1] 

We see that, typically, the minimal generators give a QFA with substan- 
tially smaller probability of error. It remains open whether one could 
find a minimal generator without an exhaustive search of all generators. 

4.2 The second construction: AIKPS sequences 

Fix e > 0. Let 

P = {r\r is prime, (logp)^"'''^/2 < r < (logp)^"'"'^}, 
5 = {l,2,...,(logp)^+^^}, 




Fig. 2. sup and sup l/^andC?)! for p = 9059, e = 0.09 and g = 2689 



p 


e 


d 


9 


^9 


Qmin 




1523 


0,1 


161 


948 


0,01517 


624 


0,00919 


2689 


0,1 


172 


656 


0,01950 


1088 


0,01060 


3671 


0,1 


179 


2134 


0,02122 


1243 


0,01121 


4093 


0,1 


181 


772 


0,01803 


1063 


0,01154 


5861 


0,1 


188 


2190 


0,01825 


5732 


0,01133 


6247 


0,1 


189 


406 


0,02006 


97 


0,01182 


7481 


0,1 


193 


6978 


0,01691 


2865 


0,01205 


8581 


0,1 


196 


5567 


0,02057 


4362 


0,01335 


9883 


0,1 


198 


1260 


0,01905 


5675 


0,01319 



Table 3. Minimal generators for different p 



T = {s-r-''\reR,seS}, 
with being the inverse modulo p. Ajtai et al. [I] have shown 
Theorem 4. For all k&{l,...,p- 1}, 

|^^2tWP|<(iogp)-|T|. 

Razborov et al. [12] have shown that powers e^**'^^/" satisfy even stronger 
uniformity conditions. We, however, only need Theorem |4l 
By taking the real part of the left hand side, we get 



teT 



2tkTTi\ 
P J 



<(iogp)-nri. 



Thus, taking our construction of QFAs and using elements of T as 
ki, . . . ,kd gives an explicit construction of a QFA for our language with 
0(log^+^') states. 



For our first, cyclic construction, the best provable result is by applying 
a bound on exponential sums by Bourgain [6]. That gives a QFA with 
0{p'^^ '°^'°^^) states which is weaker than both the numerical results and 
the rigorous construction in this section. 
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